Deriving Semantic Knowledge from Descriptive Texts Using an MT System

نویسندگان

  • Eric Nyberg
  • Teruko Mitamura
  • Kathryn L. Baker
  • David Svoboda
  • Brian Peterson
  • Jennifer Williams
چکیده

This paper describes the results of a feasibility study which focused on deriving semantic networks from descriptive texts using controlled language. The KANT system 3, 6] was used to analyze input paragraphs, producing sentence-level interlingua representations. The in-terlinguas were merged to construct a paragraph-level representation, which was used to create a semantic network in Conceptual Graph (CG) 1] format. The interlinguas are also translated (using the KANTOO generator) into OWL statements for entry into the Ontology Works electrical power factbase 9]. The system was extended to allow simple querying in natural language.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

Contextual Bitext-Derived Paraphrases in Automatic MT Evaluation

In this paper we present a novel method for deriving paraphrases during automatic MT evaluation using only the source and reference texts, which are necessary for the evaluation, and word and phrase alignment software. Using target language paraphrases produced through word and phrase alignment a number of alternative reference sentences are constructed automatically for each candidate translat...

متن کامل

The experimental MT System of the Project KIT-FAST

Within the project KIT-FAST an experimental machine translation (MT) system has been developed and implemented, which translates written German texts into English. For that reason a syntactic, semantic and aspects of a conceptual level of representation have been realized. In general each level has three dimensions, which are a sentence and a text representation, which are constructed during tr...

متن کامل

Word Sense Disambiguation: Why Statistics When We Have These Numbers?

Word sense disambiguation continues to be a di cult problem in machine translation (MT). Current methods either demand large amounts of corpus data and training or rely on knowledge of hard selectional constraints. In either case, the methods have been demonstrated only on a small scale and mostly in isolation, where disambiguation is a task by itself. It is not clear that the methods can be sc...

متن کامل

Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases

Untranslated words still constitute a major problem for Statistical Machine Translation (SMT), and current SMT systems are limited by the quantity of parallel training texts. Augmenting the training data with paraphrases generated by pivoting through other languages alleviates this problem, especially for the so-called “low density” languages. But pivoting requires additional parallel texts. We...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002